The COMLEX Syntax Project
نویسندگان
چکیده
Developing more shareable resources to support natural language analysis will make it easier and cheaper to create new language processing applications and to support research in computational linguistics. One natural candidate for such a resource is a broad-coverage dictionary, since the work required to create such a dictionary is large but there is general agreement on at least some of the information to be recorded for each word. The Linguistic Data Consortium has begun an effort to create several such lexical resources, under the rubric "COMLEX" (COMmon LEXicon); one of these projects is the COMLEX Syntax Project.
منابع مشابه
The Comlex Syntax Project: The First Year
We describe the design of Comlex Syntax, a computational lexicon providing detailed syntactic information for approximately 38,000 EnglJish headwords. We consider the types of errors which arise in creagng such a lexicon, and how such errors can be measured and controlled.
متن کاملTagging as a Means of Refining and Extending Syntactic Classes
C, omlex Syntax is a moderately-broad-coverage English lexicon (with about 38,000 root forms) being developed at New York University under contract to the Linguistic Data Consortium; the first version of the lexicon was delivered in May 1994. The lexicon is available to members of the Linguistic Data Consortium for both research and commercial applications. It was developed for use in processin...
متن کاملThe Influence of Tagging on the Classification of Lexical Complements
A large corpus (about 100 MB of text) was selected and examples of 750 fl'equently occurring verbs were tagged with their compleinent (:lass as defined by a large computational syntactic dictionary, COMLEX Syntax. This tagging task led to the refinement of already existing classes and to the addition of classes that had previously not been defined. This has resulted in the enrichment and improv...
متن کاملComlex Syntax: Building a Computational Lexicon
We des((tile tile design of Comlex Syntax, a co,nputa-tional lexicon providing detailed syntactic iuformation ff)r approximately 38,000 English headwords. We consider the types of errors which arise in creating such a lexicon, and how such errors can be measured and controlled. 1 Goal The goal of the (:omlex Syntax project is to create a moderately-broad-coverage lexicon recording the syntactic...
متن کاملHow to predict USMLE scores from COMLEX-USA scores: a guide for directors of ACGME-accredited residency programs.
CONTEXT Graduates of colleges of osteopathic medicine (COMs) frequently apply to residency training programs accredited by the Accreditation Council for Graduate Medical Education. However, students who have taken the Comprehensive Osteopathic Medical Licensing Examination (COMLEX-USA) rather than the United States Medical Licensing Examination (USMLE) may encounter a selection bias when applyi...
متن کامل